AN APPROXIMATION INEQUALITY FOR CONTINUED RADICALS 
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Abstract. In this article we derive an approximation inequality for continued radicals, 
generalizing an inequality of Herschfeld [3] for continued square roots to arbitrary radicals, 
which is useful in exploring convergence issues and obtaining convergence rates. 



1. Introduction 

Continued radicals aka infinitely nested radicals are infinite constructs of the forms 



(1) {/ai+ r i/a 2 + 

or 



(2) ••• r ya 3 + Y<*2+ Vol, 

the first one being a "right" radical and the second one "left" . Following Jones [3] let us call 



'tH + r ya 2 + V a 3 + • • • + or r W a n + • • • ^3 + r ^/a 2 + yaT the n th approxi- 

mants of the radicals ([!]) or ([2]). For left radicals, there is a recurrence relation connecting 
the successive approximants. Right radicals lack this property and to compute their ap- 
proximants one needs to start from the tail-end, i.e., r ^/a^ at each step (as Jones [1] puts 
it, they have an end but no beginning!). This and the inherent non-linearity complicates 
their analysis considerably and perhaps for that reason continued radicals have not been 
investigated as thoroughly as their other infinite counterparts like infinite series, infinite 
products or continued fractions. Moreover almost all the existing results assume the a n to 
be non- negative (however, see the amusing article [ID], where the authors consider certain 
mixed type square roots, and establish a number of beautiful formulas). Continued radicals 
with negative or more generally complex a n remain in a little-explored territory. 

In this article we shall consider continued right radicals with non-negative input sequence 
{a n }. Convergence questions about such radicals appear in Polya and Szegd's classic [6j. 
To fix notations let 



(3) u n = \ ai + \ a 2 H h 
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be the n th approximant to the the right continued square root 



(4) yai + ya2 + V«3 + --- 

with non-negative input sequence {a n }. More generally let 



(5) v n = {/ai + r \ja 2 + ■■■+ 

be the n th approximant to the general continued right radical 



(6) r \ja 1 + r ya 2 + r v / a 3 + --- 

where again the inputs a n are non-negative reals and r n are positive integers. We make a 
note that {v n } is a monotonic non-decreasing sequence of non-negative reals. Problem 162 
of [6] considers the sequence {u n } with a n > for all n. It states the following criterion for 
convergence / divergence. 

Proposition 1.1. Let 

log log a n 
iimsup = a. 

n— >oo Tl 

Then {u n } converges if a > 2 and diverges if a < 2. 

Here if a n < 1 for some n then Polya and Szego's convention is to interpret log 1 ° g a " as 
— oo. 

Problem 163, a continuation of Problem 162, asks the reader to show the following 
Proposition 1.2. The sequence {u n } converges if the series 

oo 

^2~"a n (ai • • -a„)~2 

n=l 

converges. 

Proposition 11.21 follows from the following inequality (see [21 16]). 
Proposition 1.3 (Polya-Szego). For each n > 1 we have 



(7) u n+ i -u n < 



2 n yfaiy/a2~ ■ ■ ■ 

Herschfeld [3], in a comprehensive study, established necessary and sufficient conditions 
for the convergence of both right and left continued square roots. As we are interested with 
right radicals only, we reproduce Herschfeld's criterion for right square roots. 

Theorem 1.1 (Herschfeld, 1935). The sequence {u n } converges if and only if 

Inn sup a n < oo. 

n— >oo 

It is a good exercise in classical real analysis to establish Theorem 11.11 and show that it 
encompasses the Polya-Szego criterion in Proposition !!.!! See [UI31[5l|9] for other interesting 
and useful convergence criteria, including a rediscovery of Herschfeld's criterion in |9j. 

In [3] Herschfeld derived an inequality stronger than the one in Proposition 11.3! We 
present a slightly modified statement here. 
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Theorem 1.2 (Herschfeld, 1935). We have for each n > 1, 
(8) u n+ i -u n < 



2 n y / ai H y/a^^a 2 H h 

where of course all the a n are assumed to be positive. 



Prom Theorem 1 1 . 2 1 one readily obtains Proposition 11.31 because \Jai + • • • ^fa~^ > ^Jai for 
each j = 1, . . . , n. Herschfeld used the following elementary inequality repeatedly to obtain 
the inequality in Theorem 11.21 

(9) ya + x < y/a H ==, where a > < x. 

2yja 

Theorem 11.21 may be used to infer about the rate of convergence of the sequence {u n }. 

2. An Inequality for General Continued Radicals 

In this section we shall obtain an inequality similar to the one in Theorem 11.21 for the 
more general sequence {v n } defined in ([5]). This will yield Theorem 11.21 as a special case. 
The first thing to observe is this: since we are considering general right radicals, we may 
well assume, with no loss of generality, that a n > for all n. Let us give an example to 
illustrate this point. Suppose 01,03,04 > and a 2 = 0. Clearly, we may write 



y 01 + r y a 2 + r y a3 + yai+ ••• = y a [ + + + 

where r[ = r\, a\ = a%, r' 2 = r 2 r 3 , a' 2 = 03, and for i > 3, r[ = r' i+1 , = a.i + \. It is 
evident that this procedure may be performed suitably to eliminate all zero inputs from 
consideration without changing the "value" of the radical. So, henceforth we will assume 
that a n > for all n. 

It is an interesting fact that continued right radicals have a triangular- array type structure 
when it comes to computation. Indeed one can construct a triangular array {tj,n}i<«<n, n>i 
where 



(10) ti,n = n+1 "V On+l-i + • • • + 

so that the diagonals {t ra ,n} form the sequence of the n th approximants. To compute t n>n 
one starts with t\ n = r \fa^ and successively computes t i>n = n+1 - v / a n+ i_ i + ^_i, n for 
i = 2, ...,n. To relate v n = t n ^ n to t± jTl , we need a device to invert this sequence of 
operations. This motivates us to introduce certain "deradicating" functions as follows. Let 
fo(y) '■= V and for k > 1 recursively define 

(11) fk{y) ■■= f k -i(y) rk - o fc . 

The deradicating functions fj map the diagonal term t n ^ n to the preceding terms of the n th 
row of the array, i.e. one has fj{tn,n) — tn—j,n lor j — 1, . . . , n — 1. With these functions in 
hand, we are able to state the following generalization of the inequality in Theorem 11.21 

Theorem 2.1. For n > 1, 

r n+0/a rt _|_l 

(12) v n+ i -v n < — . 



II nlh-iivn)}^- 1 

i=l 
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When n = T2 = • • • = r, say, this becomes 



(13) f n +l - «n < — 



^ni/.-iMr- 1 



In order to prove Theorem 12.11 we need the following key identity. 
Lemma 2.1. For n > 1, 



(14) f n +l - V n - 



n n—1 

EKE /i-i(«rH-i) J 7i-iK) r *- 1 - J ') 

i=l j=0 



Proof. As already observed, we have 



(15) /j(Wfi) = *n-j,n = rj+ |/ Oj+i H h Va^, for j = 1, . . . , n - 1. 

Similarly, 

Therefore, 

/«-l(Wn+l) r " - /n-l(^n) r ™ = r "+V a "+l 

i.e. 

(16) (/ n _l(u n+ l) - / n _i («„))( ^ /™-l(w„+l) : '/n-l(Wn)' rn_1 ^) = r "+V^+l- 

i=o 

Now using the identity 

Ti-l 

fi(x) - My) = fi-i(x) r * - fi-xW* = - /< :!•'•)•'./- :!//)" ' j ) 

j=0 

repeatedly to the LHS of (jT6|) we get 

n rj—1 

(/oK+i) - /oK)) II£ / ! -i(fn+i) J /i-iK) r, " 1 " : ') = r "+V^TT. 

i=l j=0 

Noting that /o(f ra +i) — fo{v n ) = v n+ \ — v n one finally obtains the desired identity. □ 

Proof of Theorem \2.1\ First note that the fi are increasing functions of y (proof: /o is 
increasing and the definitions of the fi may be used in an induction argument). Also clearly 
v n +i > v n . Therefore, fi(v n+ i) > fi(v n ) for each i = 0, 1, . . . , n — 1 and hence, 

n-i 

E /w(w"/wKr H > ri/w^r 1 . 

j'=o 

Combining this observation with identity (|14p the proof is complete. □ 

Theorem 12.11 yields the following weaker inequality which can be considered as a gener- 
alization of Corollary 11.31 
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Corollary 2.1. For n > 1 



(17) Un+1 - W n < — — r . 

n ^ r 



i=i 



Proof. The identities in (fT5j) imply that for i = 0, 1, . . . , n — 1 one has fi(v n ) > r i+^ai + \. 
Using this in Theorem 12.11 completes the proof. □ 

Specializing to the case r% = r2 = ■ ■ ■ = 2 one recovers Theorem 11.21 and Corollary 11.31 
immediately. It is a curious fact that for n = 1, the radicals become infinite series, and the 
inequality of Corollary 12.11 reduces to the (sharpest possible!) statement that a n +i < a n+ \. 

Following Corollary 12.11 Proposition 11.21 generalizes in a natural way to 

Theorem 2.2. The sequence {v n } converges if the series 

n=l \i=l / 

converges. 

Proof. We show that under the hypothesis {v n } is Cauchy. Let us denote the partial sums 
of the series S by S n with Sq := 0. As {v n } is a strictly increasing sequence, we have for 
m > n > 1, 

m— 1 



< V m - V n = ^ ( v k+l ~ v k 
k=n 



k=n 

m—1 / k r.'-lA 1 



k=n \i=l / 

oo / k _1 

< ^2 rk +yak+i I n«j ri J 

k=n \i=l / 

= S — S n —i, 

which is the tail sum of the convergent series S. Hence, given any e > 0, we can choose 
N > 1 sufficiently large such that for all n > N we have S — S n -\ < e. Then for any 
m > n > N, we have 

< v m - v n < S - S n -i < e. 
This completes the proof. □ 

3. Further Generalizations 
Theorem 12 . 1 1 can be generalized to include a more general class of continued right radicals: 



(18) w n = bi y oi + b 2 r ya 2 + ■ ■ ■ + b n 

where the a,j, bj are positive reals. Note that one can reduce w n to the form v n by bringing 
the bj "inside" , i.e. one can write 



(19) Wn = {/ci + r y C2 + • • • + yH n , 



6 SOUMENDU SUNDAR MUKHERJEE 

where Cj := b r 1 1 ' r ^2 2r ' ■■■b r i 'a,i. Now Theorem 12.11 (Corollary l2TTj) can be applied to w n . 
For computational purposes the following recurrence relation is useful: 

(20) Cj = cii ( 1 1 % | , for i > 1, and we define = bo = 1. 

Another way to obtain an inequality would be to follow the proof of Theorem 12.11 and 
define the deradicating functions fj suitably. This approach gives a simpler expression for 
the bound in terms of the aj , bj . If we define /o (y) = y and for k > 1 

f , v f fk-i(y) Y k 

and proceed as in Lemma l2.1| we have the following 
Lemma 3.1. For n > 1 we have 

n 

b„+i r «+ya ri+ i n & ? 

(21) w n+ i - w n = -; — 

ft E/i-i(%f 1 ) } 7i-iW ,, - H 

i=i \j=o J 

of which Lemma 12.11 is a special case. In the spirit of Theorem 12.11 this identity yields 
the following 

Theorem 3.1. For n > 1 we have 

n 

b n+ i r n+^/a n+1 n 

(22) w n+ i -w n < — — — . 



i=i 



The corresponding Polya-Szego type version is 
Corollary 3.1. For n > 1, 



(23) w n+1 -w n < 



i=l 



r.— 1 



n r * 



i=l 



On taking bj = 1 in (|22|) -(|23 p one recovers Theorem 12.11 and Corollary 12.11 respectively. 
Generalization of Theorem 12.21 for the sequence {w n } is a straightforward exercise. 

Theorem 3.2. The sequence {w n } converges if the series 

oo n , r ,-u -1 

S = ^2 b n+1 r n+^a n+ i Y[ h I r i a i n J 



n=l i=l 

converges. 
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4. Some Examples 

Example 4.1. Consider the case r\ = r-i = ■ ■ ■ = r > 2. Let a\ = 03 = • • • = a > 0, 
&X = b% = ■ ■ ■ = b > 0. Then we have the following infinite radical 



(24) b{Ja + b</a + ---. 

This type of radicals with r = 2 have been considered in |10j , where the authors derive many 
interesting properties of these radicals (e.g. representation of rationals by such radicals). 
Using inequality \23\) we have 

< w n+ \ — w n < cs n , with c = ba^ and s = 



r-l 

ra r 



Suppose a, b, r are such that s < 1. So, we have for m > n > 1, 

m—l 

<w m -w n ='^2 {wk+i - w k ) 

k=n 
m—l 

k=n 

00 n 

<cs n J2 sk - CS 



1 - s 

k=0 

So, {w n } is Cauchy and hence convergent. Denoting the limit by w and letting m — > 00 we 
obtain 

cs n 

< w — w n < . 

1 — 8 

Thus the convergence rate is at least geometric. 
Example 4.2. Let us consider the radical 



(25) 1 + ]j2+y3+^I+ TT: . 

Here r n = a n = n. From Corollary \2.1\ we have 



n+1 j 

, (n+i)^r n *' (3 i r+ i 

< v n+1 -v n < = » < 2 . 

, A -i-l (n\Y (n\) 2 
n! 

i=l 



i=l 

For n > 2 one may use the weaker estimate 



1 

33 

V n +l ~V n < —. 

n\ 

n 

Using this and the well-known estimate e— ^ ^ < ^7 (see, for example, [Sjj, it is an easy 

k=0 

exercise to show that for n > 2, 

1 

33 

< v — v„ < 



(n-l)(n-l)!' 
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A slightly modified version of the above radical is the interesting looking 



(26) l + 2y2 + 3\/3 + 4^4 + ---. 

For this radical a n = b n = r n = n and inequality A23\) yields 



n+l 



( ra + !)!(„ + + fi*' (n + l)(3l)"+ 1 

< U>n+1 n = i - 



1 

n! n ^ T 

i=i 

For n > 4, n! > 2 n . So, for such n we have 



w n+ i -w n <3a(n+ 1) 



n! n\ 



■ i \ n 

33 \ 



1 

Writing s = < 1, we Ziaue /or m > n > 4, 

m— 1 



i 2s n+1 
< w m - w) n < 33 (i + ly < 33 + l)s l = [1 + n(l - s)] 

Hence, for n > 4, 



< 10 - w„ < t; ^ [1 + n(l - s)]. 



2s n+1 

The estimates in Example 14.21 are crude and used for illustrative purposes only. For 
example we have used the well known fact that the sequence { {/re} achieves its maximum 
at n = 3. Better estimates are available, e.g., for all re > 1 one has 

i 1 

n« < 1 + 

In 



Note also that we have indirectly verified the hypothesis of Theorem l3.2l in both the examples 
above. 

Example 4.3. Our last example demonstrates that the bound provided by the generalized 
Polya-Szego type inequality i23\) is not quite sharp. It fails to give nontrivial bounds for the 
famous Ramanujan radical [7] 



(27) y l + 2^1 + 3Vl + ---. 

In 1911, the great Indian mathematician Srinivasa Ramanujan posed the question to find 
its value in the Journal of the Indian Mathematical Society. When no answer arrived after 
six months, he published the solution. With an ingenious manipulation he "showed" that 
the value of the above radical is 3. Ramanujan 's argument was incomplete, but the answer 
was indeed correct. For a rigorous proof see [3] and also [2] for further commentary on this 
radical. In our notation this radical has a n = 1, b n = n and r n = 2. So inequality 
yields 

(n + l)! 

w n+ i -w n < — — — , 

which is clearly trivial. 
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This failure is inherent in inequality ()23p itself. Heuristically one expects that whenever 
the b n are "large" compared to the a n , the bound provided by inequality (f23|) will be less 
precise. The last two examples of this section support this heuristic. 

5. Discussions and Questions 

The most crucial part in the proof of Theorem 12.11 is identity (|14p . One can get stronger 
inequalities than the one in Theorem 12. II bv using better lower bounds for the denominator 
in the RHS of identity (|14p . but they will be even more unwieldy. As is expected, the 
weaker inequality of Corollary 12.11 offers the most tractable estimates. Although it (and its 
more general relative (|23p ) does not give nontrivial estimates always, as seen in the previous 
section. Theorem 12.11 might help in some special problems but in general it does not give 
manageable expressions. 

Question 1: What would be a general strategy to obtain nontrivial (and manageable) 
upper bounds for w — w n when the b n are "large" compared to the a n ? 

Continued radicals generalize to continued power forms 

(28) ( ai + ( a2 + ( a3 + ...)P3) P2) p 1) 

where the pj are allowed to take any real value. Herschfeld [3J stated a generalized version 
of his convergence criterion for such power forms when pj £ (0, 1]. 

Theorem 5.1 (Herschfeld, 1935). Let 

(29) t n = (Qi + (o 2 + ... + o^ff, 
where pj £ (0, 1] for each j > 1 and suppose that the series 

oo 

S = ^Pi ■ -Pi 

1=1 

converges. Then the sequence {t n } converges if and only if 

limsupa^ 1 '" p " < oo. 

n— >oo 

Jones |D developed a ratio test type sufficient condition for the convergence of continued 
power forms with p\ = p% = ■ ■ ■ = p > 1. It would be an interesting problem to find 
approximation inequalities similar to the one in Theorem 12. II for these general power forms 
(at least when pj G (0,1)). However, note that the proof of Theorem 12.11 works only for 
positive integral rj. 

Question 2: Can Theorem l2.1l be suitably generalized to encompass the continued power 
form (I28D ? 

6. Acknowledgements 
The author is thankful to Prof. Debapriya Sengupta for many useful discussions. 



10 



SOUMENDU SUNDAR MUKHERJEE 



References 

[1] J. M. Borwein, G. de Barra, The American Mathematical Monthly, Vol. 98, No. 8 (Oct., 1991), pp. 
735-739. 

[2] B.C. Brendt, Y. Choi, S. Kang, Contemporary Mathematics, Vol. 00, 1997, pp. 6-7. 

[3] A. Herschfeld, The American Mathematical Monthly, Vol. 42, No. 7 (Aug.-Sep., 1935), pp. 419-429. 

[4] D. J. Jones, Fibonnaci Quarterly 29 (1991), pp. 37-46. 

[5] D. J. Jones, Mathematics Magazine, Vol. 68, No. 5 (Dec, 1995), pp. 387-392. 
[6] G. Polya, G. Szego, Problems and Theorems in Analysis I, Springer- Verlag, 1978. 
[7] Question 289, Journal of the Indian Mathematical Society III (1911) 90. 
[8] W. Rudin, Principles of Mathematical Analysis, Third edition, McGraw-Hill, 1976. 
[9] W. S. Sizer, Mathematics magazine, Vol. 59, No. 1 (Feb., 1986), pp. 23-27. 
[10] S. Zimmerman, C. Ho, Mathematics Magazine, Vol. 81, No. 1 (Feb., 2008), pp. 3-15 

Applied Statistics Unit, Indian Statistical Institute, Kolkata 700108 
E-mail address: soumendu041@gmail.com 



